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Abstract 


Assessing systemic risk in financial markets is of great importance but it 
often requires data that are unavailable or available at a very low frequency. 
For this reason, systemic risk assessment with partial information is poten¬ 
tially very useful for regulators and other stakeholders. In this paper we 
consider systemic risk due to fire sales spillover and portfolio rebalancing by 


using the risk metrics defined by Greenwood et al. (2015). By using the Max¬ 


imum Entropy principle we propose a method to assess aggregated and single 
bank’s systemicness and vulnerability and to statistically test for a change 
in these variables when only the information on the size of each bank and 
the capitalization of the investment assets are available. We prove the effec¬ 
tiveness of our method on 2001-2013 quarterly data of US banks for which 
portfolio composition is available. 
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1 Introduction 


After the recent troubled years for the global economy, in which two severe crises 
(the 2007 crisis of hnancial markets and the 2010 sovereign debt crisis) have put 
the whole economic system in dramatic distress, vulnerability of banks to systemic 
events is now the main focus of a growing number of investigations of the academic 
community, across different disciplines. Simultaneously, many research efforts are 
devoted to understand the role of banks or, broadly speaking, of hnancial insti¬ 
tutions in the creation and in the consecutive spreading of systemic risk. Given 
the prominent importance of the topic and its multifaceted nature, the literature 


on evaluation and anticipation of systemic events is huge (see 

Demirgiig-Kunt and 

Detragiache, 1998 Kaminsky and Reinhart, 1999 Harrington 

2009[ 

Scheffer et al. 

2009 Barrell et al., 2010 Duttagupta and Gashin| 2011| Kritzman et al., 2011 Allen 

et al. 

2012; 

Arnold et al. 

2012 Bisias et ah, 20121 Scheffer et al. 

2012 Merton 

et al. 

2013| 

Get et al., 20R 

h among many contributions). 


Several are the channels through which hnancial distress may propagate from one 
institution to another and, eventually, ahect a vast portion of the global econ¬ 
omy. Fire sales spillovers due to assets’ illiquidity and common portfolio holdings 
are dehnitely one of the main drivers of systemic risk. Shared investments create 
a signihcant overlap of portfolios between couples of hnancial institutions. Such 
(indirect) hnancial interconnectedness is an important source of contagion, since 
partial liquidation of assets by a single market player is expected to ahect all other 

(see 
, for 

a survey of the role of portfolio overlap in spreading hnancial distress). Fire sales 
move prices due to the hnite liquidity of assets and to market impact. In a per- 


and, most importantly, negative assets’ price variations. 


fectly liquid market there will be no hre sale contagion at all (see Adrian and Shin 
for a review on the role of liquidity in hnancial contagion). Finally, leverage 


2008 


amplihes such feedbacks. In fact, as described in detail by Adrian and Shin (2010 


2014), levered institutions continuously rebalance their positions inhating positive 


market participants that share with it a large fraction of their own investments 
Corsi et ah, 2013 Huang et ah, 2013 Caccioli et ah, 2014; Lillo and Pirino[ 2015 


Assessing and monitoring systemic risk due to hre sales spillover is therefore of 
paramount importance for regulators, policy makers, and other participants to the 
hnancial markets. Greenwood et al. (2015) introduced recently a stylized model of 


hre sales, where illiquidity, target leverage, and portfolio overlap are the constituent 
bricks. They used the model to propose some systemic risk metrics. Specihcally, 


systemicness and vulnerability of a bank are dehned in Greenwood et al. (2015) as. 


respectively, the total percentage loss induced on the system by the distress of the 
bank and the total percentage loss experienced by the bank when the whole system is 


in distress. The peculiarity of the systemic measures of Greenwood et al. (2015) (and 
of any other similar approach) is that, in order to estimate both systemicness and 
vulnerability, a full knowledge of the banks’ balance sheets is needed. In practice, 
it is required to know the amount of dollars that each bank invest in each asset 
class, that is the matrix of bank portfolio holdings, whose estimation from partial 
information is the main focus of our analysis. 


Greenwood et al. (2015) applied their method to the EBA data on the July 2011 
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European stress tests, which provide detailed balance sheets for the 90 largest banks 


in the European Union. Duarte and Eisenbach (2013) exploits a publicly available 


dataset of balance sheets of US bank holding companies to apply the framework of 


Greenwood et ah (2015). They derive a measure of aggregate vulnerability that [...] 
reaches a peak in the fall of 2008 but shows a notable increase starting in 2005, ahead 
of many other systemic risk indicators. Nevertheless, the detailed information set 
required to compute such indicators is not guaranteed to be easily accessible and 
its collection can be a difficult task, especially at frequency higher than quarterly. 

This discussion leads to the main topic of this paper: how it is possible to estimate 
systemic risk due to fire sales spillover in absence of data on portfolio composition 
of hnancial intermediaries. Two possible approaches have been proposed in the 


literature. The first approach (such those proposed by 

Adrian and Brunnermeier 

2011 

Acharya et al.| 

2012 

Banulescu and Dumitrescu, 

2015 

Gorsi et al. 

2015 

) is 


purely econometric and it is typically based on publicly available data on price of 
assets and market equity value of publicly quoted hnancial institutions. Generically 
the method consists in estimating conditional variables, such as conditional Value- 
at-Risk or conditional Expected Shortfall. The econometric approach circumvents 
the unavailability of data on portfolio holdings, but pays this advantage introduc¬ 
ing a strong stationarity assumption: estimates based on the past information are 
assumed to be always good predictors of the future behavior of the system. Nev¬ 
ertheless, due to the nature of a global hnancial crisis, it is in the very moment of 
the onset of a period of distress that the stationarity assumption may fail to work 
properly. Moreover it is often restricted to publicly quoted institutions for which 
equity value are available at daily frequency. 

A second possible approactQ followed in the present paper, consists in inferring the 
matrix of portfolio holdings using only a reduced information set, and/or deriving a 
probability distribution for the portfolio weights according to some criterion. This 
is typically achieved summoning the maximum entropy principle, which postulates 


that (Anand et al., 2013) [...] subject to known constraints [...] the probability distri¬ 
bution that best represents our current knowledge and that is least biased is the one 
with maximal entropy. The approach of Maximum Entropy is not new in systemic 


risk studies 

Mistrulli 

(2011 

; Anand et al. 

(2013) 

Musmeci et al. 

(2013) 

S quart ini 

et al. 

(2013) 

; Bargigli et al. 

(2015 

). It is widely used in inferring the structure of the 


interbank network when only data of total interbank lending and borrowing (plus 
possibly other information) are available. In this context, the maximum entropy 


approach has been criticized (Mistrulli, 2011 Mastromatteo et ah, 2012) because it 


infers a fully connected interbank network, while they are typically very sparse. A 


seminal contribution by Mistrulli (2011), comparing the empirical Italian interbank 


network with that reconstructed via an optimization procedure, has shown that, as 
a consequence of this misestimation, reconstructed network underestimates the risk 
contagionl^ 


Mhere are, of course, many different approach to assessing systemic risk in financial networks. 


For example, Amini et al. (20131 propose a rigorous asymptotic theory that allows to predict the 


spread of distress in interbank networks. 

complementary method to the maximum entropy is proposed by Anand et al. (20151. Here 


the authors reconstruct the network of bilateral exposures for the German banking system via the 
matrix that, preserving some constraints, has the minimum density. Nevertheless, if maximum 
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In this paper we propose to apply maximum entropy approach to the inference of the 
network of portfolio weights in order to estimate metrics of systemic risk due to fire 
sales spillovers. Specifically, we show how indirect vulnerability, systemicness 


as 


defined by Greenwood et ah , 2015) and the aggregate systemic risk of US commercial 
banks can be estimated when only a partial information (the size of each bank and 
the capitalization of each asset) is available. Unlike Mistrulli (2011[), Mastromatteo 


et al. (2012), and Anand et ah (2015) we deal with bipartite networks, namely 
graph^ whose nodes can be divided into two sharply distinguished sets that, in 
our case, are commercial banks and asset classes. More specihcally, we analyze 
the quarterly networks of US commercial banks’ exposures in the period 2001-2013 
using the Federal Financial Institutions Examination Council (FFIEC) through the 
Call Report hle^ We compute, for each quarter, systemicness and vulnerability of 
each bank and the aggregate vulnerability of the system. We compare them with 
the values inferred assuming the balance sheet compositions of the banks were not 
known. In this sense our paper is similar to Mistrulli (2011), but applied to systemic 
risk due to fire sale spillover rather than to cascades in the interbank network. 
Differently from the interbank case, we find that maximum entropy methods are 
very accurate in assessing systemic risk due to hre sales spillover when partial 
information is available. 


The contribution of this paper is divided into three main lines. First, following a 
practice that is largely diffused among researchers of both academic institutions and 
central banks (see, among others, Sheldon and Maurer, 1998 Upper and Worms 


2004 Wells, 2004| Mistrulli, 2011; Sachs, 2014), we reconstruct the matrix of portfo 


lio holdings as such that minimizes the Kullback-Leibler divergence from a “naive” 
guess. We show that this approach, while underestimating systemic risk in in¬ 


terbank networks (Mistrulli, 2011), does a very good job in our case, providing 


unbiased estimates of the aggregate vulnerability as dehned by [Greenwood et ah] 
(2015). Besides, we show that the reconstructed matrix corresponds to that implied 
by the Gapital Asset Pricing Model, hence it possesses a clear economic meaning. 


Second, we define a statistical ensemble, that is a set of graphs and a probability 
mass function dehned on it, such that the expected value of the generic element 
of the matrix yields the value expected under the Gapital Asset Pricing Model. 
Although sizes and capitalizations are the only information required to construct 
the ensemble, we show that it does a very good job in predicting systemic risk 
metrics, not only at aggregate level, but also for each bank. The performances of this 
newly proposed approach are shown to outperform those obtained with standard 


maximum entropy approaches ( 

Saracco et al. 

2015 

), i.e. the bipartite extensions 

of the models proposed by 

Mastrandrea et al. 

(2014 

) for unipartite networks. 


Finally, we show how the statistical ensemble is potentially useful for hnancial 


entropy underestimates systemic risk by overestimating the network density, Anand et al. (20151 
show that, for a similar reason, minimum density returns positively biased estimates. Hence, the 
two approaches can be used jointly together to create a corridor in which the true systemic risk 
should lay. 

■^Throughout all the manuscript we use the terms “netw ork” and “graph” interchange ably. 

"^Hence our dataset is quite similar to that adopted by Duarte and Eisenbach (2013), but it 
profits from a larger sample of banks, since commercial banks are fairly more numerous than bank 
holding companies. All data are available at: https://cdr.ffiec.gov/public/ 
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regulators or supervisory authorities in their monitoring activities. As a matter of 
fact, the statistical ensemble implies a probability distribution for any risk metrics 
and hence it could be used to statistically test if a specific institution has increased 
its systemicness with respect to a date in the past. 


We structure our paper as follows. Section introduces some nomenclature and 
briefly describes the risk metrics of Greenwood et al. (2015). The dataset of US 
commercial banks provided by the FFIEC is discussed in Section In Section 
1^ we derive the solution of the Cross-Entropy problem and we apply it to our 
dataset. Section |5] is dedicated to the definition the statistical ensembles whose 
performances in estimating risk metrics are investigated in a comparative analysis 
reported in Section A statistical test useful for surveillance activities by central 
banks and other regulatory institutions is presented in Section Finally Section 
[^summarizes the main contributions of the paper. Appendices provide additional 
information on the construction of the dataset of bank portfolio holdings and all 
the analytical computations omitted in the main text. 


2 Systemic risk metrics: Vulnerability and Sys¬ 
temicness 


In this paper we use some metrics of systemic risk of individual banks and in 
aggregate, which have been recently introduced by Greenwood et al. (2015). Here 
the authors presented a model in which fire sales propagate shocks across balance 
sheets of banks. More specifically, they consider a system composed by N banks 
and K asset classes. 


First of all, we introduce the matrix X of portfolio holdings, whose element is 
the dollar-amount of /c-type assets detained by bank n. The corresponding matrix 
of portfolio weights is thus written as 


(X) 


^n,k 


Ylk’=l ^n,k' 


In what follows, we introduce a discretization of the elements of X’s, in such a way 
that the matrix X belongs to the space of A^ x iF integer valued matrices. 

In the empirical application we will use the resolution of the dataset which is 10^$. 

The total asset size A„ of the n-th bank and the total capitalizationj^ Ck of the fc-th 
asset class are easily computed as, respectively, the total row and column sums of 
the matrix Xn^k-, in formula 


K N 

(X) = Ck (X) = (1) 

k=l n=l 

where we have explicitly expressed the dependence of An and Ck from X. 

®More precisely, the quantity Ck is the total amount of asset’s k capitalization due to the 
banking sector. To simplify the notation we will call it capitalization. 
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The rectangular matrix X can be naturally associated to a bipartite network, i.e. 
a graph whose vertices can be divided into two disjoint sets such that every edge 
connects a vertex in one set to one in the other set, the two sets being the banks 
and the asset classes. In the network jargon. An (X) and Ck (X) are called the 
strength sequences for, respectively, the top (banks) and bottom (assets) nodes of 
the network represented by the matrix X. 


A relevant information concerning the balance sheet of each bank n is t he total eq- 
uity En, from which one can compute the leverage as Bn = (as in Greenwood 


et ah, 2015). Finally, each asset class is characterized by an illiquidity parameter 
ik, with k = 1,..., X, defined as the return per dollar of net purchase of asset ^ 


This setting is used in Greenwood et al.[ (2015) to define some metrics of systemic 
risk, capturing the effect of hre sales in response to a shock on the price of the 
assets. This is described by the K dimensional vector —e = (—£i,..., —£k)-i whose 
components are the assets’ shocks. They define: 


• Aggregate vulnerability AV as [...] the percentage of aggregate bank eguity 
that would be wiped out by bank deleveraging if there was a shock [...] to asset 
returns. 

• Bank systemicness Sn as the contribution of bank n to aggregate vulnera¬ 
bility. 

• Bank’s indirect vulnerability IVn as [...]as the impact of the shock on its 
eguity through the deleveraging of other banks. 


By assuming that banks follow the practice of leverage targeting and that, in re¬ 
sponse to a negative asset shock, they sell assets proportionally to their pre-shock 
holdings. Greenwood et ah (2015) show that Sn can be decomposed as 


Q — F jih R r 

n j-i -^n ' n? 

tj 


( 2 ) 


where E is the total equity, E = '^n=i 'f'n is the n-th element of the vector 
r = W £, i.e. the portfolio return of bank n due to the shock e, and 


K / N 


Bn = ^ I ^ Am Wm,k | 4 Wn,k- 


k=l \m=l 

The aggregate vulnerability is computed simply as 


N 


AV = Y,Sn. 


(3) 


n=l 


®The assumption of linear price impact comes directly from the framework of Greenwood et al. 
(|2015|. Although a square-root law hts the data better, the linear assumption has been widely 


adopted in the literature (see Gatheral et al., 2012 Gont and Wagalath |2014 Lillo and Pirino 


2015 

among others) and has been empirically validated at daily frequency by 

Obizhaeva 

00 

o 

o 
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Finally, the indirect vulnerability of a bank is 


K 


N 


IV„ = (1 + B^) Y, 4 Wn,k Y 


' r„/. 


(4) 


fc=i 


n'=l 


In what follows we assume, as in Duarte and Eisenbach (2013), that e*, = 1% for 


all k = which in turns implies that = 1% in equations ([^ and (|^. 

Note however that if all the assets are shocked by the same amount, our results do 
not depend on it, since the systemic risk measures will have only a different pre¬ 
factor. Note also that both systemicness and indirect vulnerability must be thought 
as quantities expressed in percentage. Finally, we set the liquidity parameter at 
Ik = 10“^° for all asset classes except for cash, for which we put = 0 (as in 


Greenwood et al. 

2015 

Duarte and Eisenbach 

2013 


In the next section we provide a detailed description of the dataset that we adopt 
in our analysis to measure systemic risk, as captured by the metrics of Greenwood! 
et al. (2015), in the US banking sector. Such a dataset allows us to have quarterly 


estimates of systemicness, aggregate, and indirect vulnerability and to compare 
these estimates with those inferred from the Cross-Entropy approach and the Max¬ 
imum Entropy principle. Since we have to deal with both real and reconstructed (or 
sampled form a statistical ensemble) networks, in order to avoid ambiguity, from 
now on we follow the convention to add a superscript x* to any variable x when¬ 
ever it is referred to a real (observed) network, while the variable x is represented 
without the superscript -k every time it is referred to a reconstructed network (e.g. 
one sampled from a statistical ensemble as described in Section]^. 


3 Data 

All regulated hnancial institutions in the United States are required to hie periodic 
hnancial information with their incumbent regulators. The Federal Financial In¬ 
stitutions Examination Council, is the regulatory institution responsible to collect 
and maintain the data used in our analysis. The hnancial institutions subject of 
our investigation are Commercial Banks and Savings and Loans Associations. The 
FFIEC dehnes officially a commercial bank a^ “/■ ■ 7 ® financial institution that is 
owned by stockholders, operates for a profit, and engages in various lending activi¬ 
ties^'. FFIEC requires commercial banks to hie the quarterly Consolidated Report of 
Condition and Income, generally referred to as Call Report. Each bank is required 
to hll a form with detailed information on its hnancial status, in particular its bal¬ 
ance sheet. The specihc reporting requirements depend upon the size of the bank 
and whether or not it has any foreign office. The form FFIEC031 is used for banks 
with both domestic (U.S.) and foreign (non-U.S.) offices while form FFIEC041 is 
designed for banks with domestic (U.S.) offices only. A Saving and Loan Associa¬ 
tion is a hnancial institution that accepts deposits primarily from individuals and 
channels its funds primarily into residential mortgage loans. From the hrst quarter 
of 2012, all Savings and Loan Associations are required to hie the same reports, 
thus they are included in the dataset since then. 

^See http://www.ffiec.gov/nicSearch/FAQ/Glossary.html 
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Figure 1: We report in the left panel the percentage of total assets detained by top 10, 
top 100, top 1000 and the remaining banks in shaded areas of different colors. A vast 
portion of total assets is controlled by the top 10 banks. In the right panel we report, for 
each quarters, the contribution of the top seven asset classes (in terms of capitalization) 
to the total capitalization. A large percentage to total asset capitalization is due to Loan 
Secured by Real Estates in Domestic Offices. 


The data provided by the Call reports are publicly available since 1986, although 
the form changed considerably throughout the years, showing an increasing level 
of details requested. To have a good compromise between the fine structure of 
data and a reasonably populated statistics we considered the time period going 
from March 2001 to September 2013, for a total of 55 quarters. The number of 
hnancial institutions present in the data is pretty stable during quarters, starting 
from approximately 9, 000 entities in the hrst quarter and ending in roughly 6, 500 
in the last one. The asset categories have been created as coherent sums of codes. 
We describe the procedure adopted to form asset classes in Appendix along with 
some data statistics. In particular, we aggregate data in a set of 20 asset classes 
following the rationale of Duarte and Eisenbach (2013), that is each of the 20 asset 
classes is composed in such a way that, in case of a hre sale of assets belonging 
to a specific class, the price impact would be restricted mainly to the assets in 
the same class. In other words, it is reasonable to assume that the co-illiquidity 
of two different asset classes is negligible. The twenty macro asset classes used to 
build the network are described in Table [T] of Appendix which also documents 
in detail how they have been formed. In the left panel of Figure [T] we show how the 
total asset value is concentrated on the top tiered banks. The right panel of Figure 
shows the relative importance of the top seven assets classes (in terms of total 
capitalization), revealing that a large portion of the total capitalization is due to 
Loan secured by real estates in domestic offices. 


Hence, we are able to construct, for each quarter form the hrst of 2001 to the third 
of 2014, a matrix X* of bank holding whose element X* ^ is the total dollars invested 
by the n-th bank in the /c-th asset class. Finally, it is important to note that the 
matrix X* has around 50% of zero entries. Thus the network is relatively dense, 
but far from being fully connected: it is not true that each bank invest in all 20 


®See 

https://www.chicagofed.org/banking/financial-institution-reports/ 

commercial-bank 

-data. 
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assets classes. 


4 The Cross-Entropy Approach 


As anticipated in the introdnction, Cross-Entropy is a popnlar method, largely 
adopted by scholars and researchers of central banks, nsed to reconstrnct a target 
matrix (typically the inter-bank matrix) from partial knowledge of its properties. 
The idea is to select an a priori gness for the matrix and then to hnd its closest 
matrix snbject to some constraints. The gness matrix is chosen either by nsing some 
randomness assnmption or by nsing some economic intnition. The constraints are 
nsed to enforce all the partial knowledge on the target matrix to be estimated. In the 
simplest case, snch constraints are non negativity conditions of matrix elements and 
the total row and colnmn snms. Finally, as a measnre of distance to be minimized 
between the gness and the target matrix one nses the Knllback-Leibler divergence. 

For the specihc case of the system of bank holdings for US commercial banks we 
assnme to have at our disposal, for each quarter, only the information on the total 
asset size A* for the n-th bank and the total capitalization for the fc-th asset 
class, as provided by the FFIEC data. The Cross-Entropy approach derives the 
target matrix X as that which solves the optimization problem 


min 

X 

N K 

EE-’f” 

n=l k=l 

,k log 

^n,k ] 

^^n,k j 

1 


N 





s.t. 

E 


a;, n 

= 1 ,.. 

.,N, 


n=l 






K 






E 

^n,k 

Cl, k 

= 1 ,.., 

.,K, 


k=l 







IV 





( 5 ) 


where Xn,k are the entries of the guess matrix. Note that the cases analyzed in the 
interbank lending literature (Mistrulli, 2011) typically have an additional constraint 
on the diagonal elements, required to avoid a single institution to be simultaneously 
a borrower and lender to itself (see, for example, the Appendix B in Mistrulli, 2011). 
The matrix of portfolio holdings analyzed here does not require any of such kind 
of restrictions, a feature that extremely simplihes the solution of problem ([^. In 
fact, the initial guess as proposed, for example, by Mistrulli (2011) coincides, in 
our settings, with the capital asset pricing model (CAPM). In a standard CAPM, 
investors choose their portfolio in such a way that each weight on a stock is the 
fraction of that stock’s market value relative to the total market value of all stocks 

Since A* is the total asset size of the 


Sharpe 

1964 

Lintner 

1965 

Mossin 

1966 


n-th bank and since the total market value of all stocks is given by L* = Efc=i 
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the CAPM portfolio weights are given bj|^ 


C* 

^CAPM A-k 

^n,k ^n' 


( 6 ) 


The CAPM-implied holdings are a natnral choice for the initial guess fc, 

since they are the simplest combinations of the strength sequences that verify the 
constraints. Nevertheless, given that in (|^ the condition on the diagonal elements 
is absent and since the Kullback-Leibler divergence is always positive, the optimal 
solution of the Cross-Entropy problem in (|^ when ^ = X^^^ is nothing but the 
X^^^ itself. Hence, thanks to the bipartite nature of the network under study we 
do no have to resort to numerical routines to solve problem (|^. If other constraints 
are added to the problem (e.g. that some banks cannot invest in some asset classes) 
one could numerically solve the problem ([^ with the additional constraint. 

We now test empirically the validity of the CAPM-implied portfolio holdings on 
our data. Notice that the true matrix X is quite different from X, since the latter 
has no vanishing elements, while in the former half of the matrix elements are zero. 
Therefore it is not a priori obvious that the CAPM-implied matrix is able to capture 
the dynamics of systemic risk. This is however the case, as clearly witnessed by the 
plot in Figure where we compare the true value of the aggregated vulnerability 
(formula (j^ of the previous section) as found in the dataset described in Section 
with that computed on the CAPM-implied portfolio holdings. 


The main implication of Figure is that, at least for the dataset under analysis, it 
is not necessary to know the matrix X* ^ to assess the systemic risk as measured by 
the aggregate vulnerability. The knowledge of banks’ size and assets’ capitalization 
is enough to infer the matrix X^^^, which very well reproduces the aggregate 
behavior (in terms of systemicness) of the system. This is different from the result 
of Mistrulli (2011) for the interbank network, since he hnds that the Cross-Entropy 
approach signihcantly underestimates systemic risk, while in our case the bias is 
negligible. 


At this stage two main issues remain to be addressed. The hrst is how accurate is the 
estimate of systemicness and indirect vulnerability of each single bank as provided 
by the X^^^ matrix and, the second is which kind of model can be adopted to 
statistically validate a variation of systemicness (or indirect vulnerability) for a 
specihc bank. In what follows, we turn to these issues. 


5 Network Statistical Models 


While providing a very accurate approximation of aggregate vulnerability, the 
matrix cannot say much about statistical signihcance of systemicness and indirect 
vulnerability of banks. For this purpose a statistical null model is required. 


We dehne a statistical network model following the theoretical framework of Ko- 


laczyk (2009). A network (statistical) model is dehned by a set X of graphs that is 


®More precisely, we are deriving what can be addressed as a “banking-CAPM” since, as men¬ 
tioned before, represents the total amount of asset’s k capitalization due to the banking sector 
and not its total market capitalization. 
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Figure 2: This figure reports as a black continuous line the aggregated vulnerabil¬ 
ity, as defined by equation (|^, computed on the matrix ^ of portfolio holdings 
as provided by the FFIEC dataset of US commercial bank holdings, described in 
Section The red dotted line is the aggregated vulnerability computed on the 
CAPM-implied matrix defined in equation ([^ and that requires only the informa¬ 
tion on the strength sequences A* and C^. 


called ensemble and a probability mass function indexed by a vector of model 
parameters i?. In formula it is expressed as the triplet 


{P^, A, -J? e H }, 


where S is a convex subset of with P the total number of parameters of the 
model. The set A is a countable set whose element are called graphs. In what 
follows, we will not distinguish between the graph and the associated matrix X, 
i.e. the probability mass function is defined in the space of integer valued matrices. 
Moreover the probability mass function P.,j [X] is a function defined on X with value 
in [0,1] 

P^ : A —>■ [0,1] , 

and such that ~ Such a probability is allowed to depend on 

a vector of real parameters i? G 5. For an arbitrary (regular enough) network 
function F : A —)■ M defined on the set X (see for example the functions defined in 
equations ([I|-(p)^), the expected value of F on the ensemble X is defined as 

[^] = 

A model can be defined by explicitly giving the ensemble, the probability mass 
function along with the space 5 of the parameters, or by deriving P^ [X] through 
the recurrent application of some generative mechanism or rule, either starting from 
an empty graph or by applying a randomization procedure to a reference graph. 
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We define our network statistical models in the next subsections by invoking the 
Maximum Entropy principle. However, before proceeding, it is important to stress 
that, once a network model has been defined, there are at least two possible applica¬ 
tions. First, in every situation in which solely the partial information inputed in the 
maximization constraints is available (and hence the true matrix X* is unknown), 
the network model can be used to estimate the true (unobserved) value of any net¬ 
work function F (X*) (e.g. systemicness and indirect vulnerability of banks), by 
proxying it via, for example, [E]. Second, as mentioned before, when the entire 
matrix X* of bank holdings is known, we can use the associated P,? (X) to statisti¬ 
cally validate any network function F (X*) as observed on real data, for example by 
testing whether the systemicness Sn (X*) of a specihc bank has statistically grown 
in time. The hrst application is discussed in Section while the second in Section 

m 

5.1 The Maximum Entropy Principle 

One way to construct ensembles of networks is by invoking the Maximum Entropy 
(ME) principl^^ In the general setting, the probability mass function P^ is the 
one that maximizes the Shannon’s entropy 

5 = -5^P^[X] log(P^ [X]) 
xev 

with the normalization constraint 


E IX] = 1. 

xeA' 

and further additional constraints may be added to dehne a specific ensemble (see 
below). In our case, given the peculiar role of the CAPM in reproducing the ag¬ 
gregate vulnerability, we derive the probability mass function P^ by solving the 
maximization problem 


max 

- 5^P^[X] log(P^ [X]) 
xeA’ 


s.t. 

5^P,|X] = l 

xeA^ 

n = l,...,iv, k = l,...,K. 

(7) 


We call this model Maximum Entropy Capital Asset Pricing Model (shortened in 
MECAPM henceforth). In Appendix B.l we prove that the MECAPM has the 
unique solution 


i^We remember that, beyond the the statistical validation of networks, a relevant application 
of the ME principle is the estimation of probability densities, as it is witnessed by the vast 
econometric literature that hinges on ME (see, among others, the contributions by Zellner and 
Hlghfieldl |1988| |Ryul |1993| |Wu| [2003) |Kouskoulas et al.[ |2004| |Park and Bera| |2009l |Usta and 
Kantarl 120111 iChenl 20151). 
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[X] = 


N K 

nn 

n=l k=l 


■V^CAPM 
^n,k 

1 + 


X, 


n,k 


1 + ^nT"’ 


( 8 ) 


hence each single matrix entry Xn,k is geometrically distribnted with mean 

Since other specihcations of maximnm entropy are qnite popnlar in the literatnre of 
network reconstrnction, only for comparison purposes we take into considerations 


two other ensembles, mainly inspired by the paper by Mastrandrea et al. (2014) 


and Saracco et al. (2015). Each of them is characterized by different constraints 


imposed on the maximization of the Shannon’s entropy. 


In the hrst, whose probably mass function is found in Appendix B.3 and that we 
refer as Bipartite Weighted Conhguration Model, the constrained maximization is 
modihed in 


max 

P o 


[X] log(P^ 

[X]) 


xex 



s.t. 


] = 1 



xex 




Ea” [A„] = 

A*, n = 1, 



[Ck] = 

Cl, k = 1, 



BIPWCM imposes weaker constraints with respect to MECAPM, while exploiting 
the same information set. Despite this, MECAPM has a fully analytical solution 
while BIPWCM has not. As shown in the Appendix, it is possible to write a 
system of equations for the Lagrange multipliers of the optimization problem and 
the system is easily solved numerically. 


Finally, we consider another (richer) statistical ensemble whose probability mass 
function, derived in Appendix B.3[ corresponds in ou r bipartite framework to the 
enhanced conhguration model of Mastrandrea et al. (2014). This newly dehned 
ensemble, that we address as Bipartite Enhanced Conhguration Model (BIPECM), 
is obtained via Maximum Entropy imposing both the mean value of strengths (as 
in BIPWCM) and the mean value of degrees, that is the number of edges incident 
in each vertex. In other words, we reconstruct the matrix by assuming the knowl¬ 
edge of the number of assets in which each bank invests as well as the number of 
banks investing in each asset. Despite the fact that this information is typically 
not known, we consider this ensemble to show that even with an information set 
signihcantly larger than the one used in MECAPM it is very difficult to outperform 
it. Mathematically, the BIPECM is obtained by solving the optimization problem 
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max 


s.t. 


-ZPo 

[X] log 

(E^ [X]) 


xea 





.] = 1 



xea 




Ea’ [An] = 




ea [Dr] 

_ ^row’' 

, n = 1,, 

...,W 

Ea [Ck] = 

Cl 



1-1 

'o 

= 

, k = l,.. 

..,x. 


( 9 ) 


where -D™" and are, respectively, the row and the column degree sequences 
(see Appendix B.3 for more details). The peculiarity of BIPECM is the addition 


of the information on the degree sequences that is absent in both BIPWCM and 
MECAPM. Note that the three ensembles can be used not only for statistical in¬ 
ferences, but to produce estimates of any function dehned on the network, which is 
the topic of the next section. 


6 Assessing systemic risk for individual banks 

In this Section we compare the performance of the three different ensembles pre¬ 
sented above in order to assess systemic risk for individual banks. The three esti¬ 
mators are 


• MECAPM estimator: only the information on the strength sequences is used, 
systemicness and indirect vulnerability are estimated through their expected 
values according to the probability mass function in ([^. 


BIPWCM estimator: as for the MECAPM case only the information on the 
strength sequences is used, nevertheless the constraints imposed on the max¬ 
imization are simpler. Systemicness and indirect vulnerability are estimated 
through their expected values according to the probability mass function in 

( 0 . 

BIPECM estimator: the information on the strength and degree sequences 
is used, systemicness and indirect vulnerability are estimated through their 


expected values according to the probability mass function in (23) 


Notice that, for each estimator, there are at least two possible points of comparison. 
Suppose, for instance, that we want to estimate S*, that is the systemicness of bank 
n. We could either proxy S* via [“Sn (X)] or S'„ {Kx [X]), where E,x [X] is the 
expected adjacency matrix according to the network probability distribution. 

A peculiarity of the MECAPM is that the computation of the distribution of each 
single entry Xn,k does not require any numerical procedure, and hence we can pro¬ 
vide explicit formulas for both Ex [S'n] and Ex [IV„]. In Appendixj^we derive these 
formulas and, in particular, we show that the expected values Ex [S'„] and Ex [IV^] 
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are well approximated by, respectively, S'„ {E^ (X)) and IV„ {E^ (X)) which, for 
the MECAPM case, coincides with the systemicness and indirect vnlnerability of 
bank n as retnrned by the CAPM-implied matrix. This resnlt shows that the Cross- 
Entropy and the MECAPM are almost eqnivalent, at least for what concerns the 
estimates of the average risk metrics. 

In evalnating the performances of each class of estimators it is important to have 
good estimates for the most systemic/vnlnerable banks, being all the remaining ones 
not particnlarly relevant in the context of systemic risk. For this reason we proceed 
as follows. Snppo^we want to assess the performance of an estimator that prodnces 
estimates Sn and IV^ of, respectively, systemicness and indirect vnlnerability of the 
n-th bank. For each bank n and for each qnarter we compnte the relative error in 
estimating both and IV„ as 


Sn 


Sn-Sl 


s 


★ 

n 


'^n 


IV n - IV 


★ 

n 


IV 


-k 

n 


( 10 ) 


We then divide the whole sample of banks in fonr qnartiles, according to the true 
bank systemicness (or the true indirect vulnerability) in the investigated quarter 
and we compute the relative errors (10) for all the banks present in the quartile and 
according to each of the three estimation methods. We finally plot the median of 
the relative error and, as a measure of dispersion, we take record of the interquartile 
range, i.e. the difference between the upper and lower quartile^^ Clearly a median 
well centered around zero is an indication that the estimator is unbiased. 


Figure shows the results for bank systemicness and Figure shows the results 
for indirect vulnerability. The three different colors refer to the three different 
estimation methods (see the figure caption for more details). We observe in all 
panels and for each quarter that BIPWCM strongly underestimates individual bank 
systemicness and indirect vulnerability. The median relative error ranges between 
—80% and —50% and the interquartile range includes zero only for the first quartile, 
i.e. for the least systemic/vulnerable banks, those who are of less interest in systemic 
risk assessment. The estimator based on BIPECM (using the additional information 
on degrees) gives slightly better results, even if a strong underestimation is still 
present. The median relative error ranges between —60% and —25% and again the 
interquartile range includes zero only for banks in the first quartile. 

On the contrary the estimator based on MECAPM performs much better. The 
median relative error never goes below —30% and almost always the interquartile 
range is centered around zero. The most notable exceptions refer to the hrst quartile 
(top left panels), which include least important banks and the fourth quartile of 
indirect vulnerability. In this important case the median relative error is around 
—25%-^30%. Even if we do not have an explanation for this behavior, it is important 
to notice that MECAPM strongly outperforms the other two estimators. 

In summary, the estimates of systemicness and indirect vulnerably for each single 
bank as provided by the CAPM-implied matrix are almost identical to those ob¬ 
tained as the corresponding expected values on the MECAPM ensemble. Besides, 

choose these metrics because they are more robust and less sensitive to outliers. 
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Figure 3: Time series of the relative error (see Eq. [Tol ) of bank systemicness with respect 
to real data as estimated by the three ensembles BIPWCM (red and squares), BIPECM 
(blue and circles), and MECAPM (grey and dashed line). The thick lines indicate the 
median and the colored areas the interquartile range (red cross for BIPWCM, blue squares 
for BIPECM and dotted line for MECAPM). The four panels refer to four quartiles of 
banks according to their systemicness. First quartile includes the banks with smallest 
systemicness. 


they are satisfactorily accurate and surely more reliable than those provided by 
standard maximum entropy ensembles. 

Once more, the important message is that it is possible to achieve pretty accu¬ 
rate estimates of systemic risk metrics, at the aggregate or individual institution 
level, due to hre sales spillover without a full knowledge of the portfolio holdings of 
hnancial institutions. 


7 Monitoring and testing changes in systemicness 

As another application of the ensembles of graphs obtained with the Maximum En¬ 
tropy method, we consider here the problem of assessing whether the systemicness 
of a given bank (or of the whole system) has changed in a statistically signihcant 
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Figure 4: Time series of the relative error (see Eq. 10) of bank indirect vulnerability 


with respect to real data as estimated by the three ensembles BIPWCM (red and squares), 
BIPECM (blue and circles), and WCAPM (grey and dashed line). The lines with symbols 
indicate the median and the colored areas the interquartile range (red cross for BIPWCM, 
blue squares for BIPECM and dotted line for MECAPM). The four panels refer to four 
quartiles of banks according to their vulnerability. First quartile includes the banks with 
smallest indirect vulnerability. 








































way. In order to answer this question, it is necessary to have a null hypothesis 
and we propose to use network ensembles to this end. Since the MECAPM shows 
superior performances in estimating risk metrics, in this section we use it and we 
propose a possible application for statistical validation. Our objective here is not to 
study all the banks and all the quarters, but only to show how the testing method 
can be implemented. 

In particular, imagine a regulator who monitors a given bank, measuring its sys¬ 
temicness and searching for evidences of a significant increase. Having a given 
quarter as reference, the regulator can extract the distribution of bank’s systemic¬ 
ness and, in the subsequent quarters, identify when the systemicness is outside a 
given conhdence interval around the reference period. As a special case, we select 
four banks among the top fifty in the first quarter and that exist for the entire time 
period (i.e. they do not exit the dataset). For each quarter we compute the true 
bank systemicness and the 5%-95% conhdence bands according to the MECAPM 
ensemble (see Figure]^. We then added a magenta square in each quarter when 
the true systemicness is above the 95% conhdence band of the hrst quarter, used as 
reference. Hence, a magenta square is indicating a quarter when the systemicness 
of the bank is statistically larger (according to the MECAPM) than at beginning of 
2001. We show two banks for which a statistically signihcant change in systemic¬ 
ness is observed (top row) and two for which no change is observed (bottom row). 
Notably, for the former case we hnd that the systemicness of the banks analyzed 
increased signihcantly much before the onset of the 2007-2008 hnancial crisis. This 
phenomenon persisted along the entire period of the crisis and vanished not before 
the end of 2009. This suggests that network statistical models could be of valuable 
help in the surveillance activity of central banks and other supervisory authorities 
as monitoring tools and in constructing early warning indicators. 


8 Conclusions 


In this paper we focused on the problem of estimating metrics of systemic risk due to 
fire sale spillover in presence of limited information on the composition of portfolios 
of hnancial institutions. A full knowledge of the portfolio holdings of each institu¬ 
tion in the economy is required to have a precise estimate of any risk metrics that, as 


those proposed by Greenwood et ah (2015), is based on the mechanism of portfolio 


rebalancing through hre sales. Nevertheless, such a huge and detailed information 
may not be available, especially at frequency higher than quarterly, making the 
estimation of systemic risk quite difficult. In this paper we circumvent the problem 
by providing accurate estimates of systemic risk metrics that are based on a partial 
knowledge of the system, more precisely only on the sizes of balance sheets and the 
capitalization of assets (or asset classes), which are much easier to trace. In this 
respect, we have shown that the largely diffused method of Cross-Entropy minimiza¬ 
tion returns, for the system under analysis, the Capital Asset Pricing Model and 
that it does a very good job in estimating aggregate vulnerability without requiring 
any knowledge of the underlying matrix of bank portfolio holdings. Furthermore, 
we introduced a Maximum Entropy (ME) ensemble that reproduces, on average, the 
CAPM and performs quite well in estimating systemicness and indirect vulnerabil- 
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Figure 5: We report, for four selected banks, the true systemicness (thick dotted 
lines) and the 5%-95% conhdence bands according to the MECAPM ensemble. A 
magenta square is added in every quarter in which the systemicness of the bank is 
above the 95% conhdence level of the hrst quarter of 2001. 























ity of single institutions, outperforming standard ME competitors. The estimation 
of systemic risk metrics could provide valuable information to any policy maker, 
but variations in systemicness and indirect vulnerability are difficult to interpret 
in absence of a statistical validation. For this reason, as a hnal contribution, we 
have proposed the ME ensemble as a tool to assess the statistical significance of 
systemic risk metrics. On a selection of banks of our dataset we documented that 
their systemicness significantly increased, with respect to the level observed at the 
beginning of the 2001, much before the onset of the 2007-2008 hnancial crisis. Even 
if deeper investigations are required in this direction, we believe that this approach 
could be easily implemented as an early warning indicator of systemic risk. 
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A Data Description and Dataset Creation 


This appendix provides some descriptive features of the data along with the method 
adopted to build the 20 asset classes of the bank-asset network analyzed in the 
paper. The left panel (first row) of Figure reports, on a log-log scale, the kernel 
density of the bank sizes (i.e. the total amount of assets detained by the bank) 
pooled across all quarters. It is evident that bank sizes are quite heterogeneous. 
The right panel (hrst row) of Figure reports the density of the bank leverages 
Bn pooled across all quarters. In this case we observe a much less heterogeneous 
distribution, with most banks showing a leverage around 10. Finally, the second row 
of Figure [^reports the relation between size and leverage. The plot is achieved by 
sorting all records of bank size from the smallest to the largest and then applying a 
moving-window procedure. As expected from the density plots, there is no relation 
between leverage and bank size, having most bank a leverage of 10 and a highly 
heterogeneous size. 

Concerning the formation of the asset classes used in the main text, we provide in 
what follows details on how they have been created. As mentioned in the main text, 
the focus of the paper is on commercial banks, whose precise definition is given by 
the FFIEC as [...] every national bank, state member bank, insured state nonmember 
bank, and savings association is required to file a consolidated Call Report normally 
as of the close of business on the last calendar day of each calendar quarter, i.e., 
the report date. The specific reporting requirements depend upon the size of the bank 
and whether it has any ‘foreign” offices [...]. This is the set of institutions that is 
referred as Commercial Banks throughout all the paper. 

Forms FFIEC031 and FFIEC041 are dedicated to, respectively, banks with only do¬ 
mestic offices and banks with domestic and foreign offices. However, in both forms, 
it is adopted the same coding system. More specifically there are only two types 
of codes, RCON and RCFD, which are followed by a four digits alphanumerical 
code. The alphanumerical code identifies the budget item, for example 2170 refers 
to total assets of the bank. The prefix RCON is used for financial items relative to 
domestic offices, while RCFD encompasses both domestic and foreign offices. Hence 
RCON2170 is the code for the total assets of the bank detained in U.S. offices, while 
RCFD2170 is relative to the sum of total assets detained in U.S. plus offices abroad. 
Of course, for banks that £11 the FFIEC031 the two codes RCON and RCFD report 
the same value if they have the same alphanumerical code. 

Table [^documents the detailed composition of each asset class. For each asset class 
(first column) we report the composition in terms of FFEIC items in the third col¬ 
umn and a short name given to the asset class in the second one. Such abbreviation 
is needed since some asset class, e.g. “loans to consumers in foreign offices”, are 
assembled subtracting from the FFIEC codes some previously defined asset classes. 
There is a one-to-one correspondence between asset classes and variable names, a 
part for the case of “loans secured by real estates in domestic offices”, which is 
computed as the sum of five variables, from “construction loans” to “non farm, non 
residential”. The composition of the FFEIC formula reported in the third column 
may vary during time, hence we report in bold the period of validity of the formula 
adopted. In this respect, note that the date 12/99 refers to the last available quar- 
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ter, that is the third quarter of 2014. In reporting the FFEIC formula, we adopt 
the convention that the prefix is omitted whenever RCON is used solely for banks 
with only domestic offices and RCFD solely for those that have at least on office 
abroad. On the contrary, when the prefix is specified, it means that only the code 
with that particular prefix is being been used. For example the code RCON3532 is 
used only in its domestic version, hence we do not use RCFD3532 for banks with 
offices abroad. 
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Size Leverage 



Mean Size 


Figure 6: This Figure reports some descriptive features of the data anaiyzed. Top ieft panei piots, on a 
iog-iog scaie, the kernei density of bank sizes (defined as totai assets in unit of 10^$) whiie top right is 
the kernei density of the bank ieverages. Both densities are computed using aii records pooied across the 
entire time span. For the sake of visuaiization, we put a cut-off of 50 on the maximum ieverage aiiowed, 
aithough ieverages of more than 150 are (rareiy) observed. The bottom panei shows that there is no reiation 
between ieverage and size. The procedure adopted to draw the piot is the foiiowing: aii records of bank 
size are sorted from the smaiiest to the iargest one and a roiiing window of 1000 records is moved, with an 
incrementai shift of 10 records, from the hrst to the iast. In each window we compute the mean ieverage 
(biack continuous hne) and the standard deviation of ieverage (red dotted hire) of banks that faii in the 
window. Mean and standard deviation are piotted as a function of the mean size in the window, which is 
reported in the horizontai axis. 
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Table 1: Composition of Asset Classes 


Asset Class 

Variable Name 

FFIEC Formula 

Total assets 

tot_ass 

03/01-12/99: 2170+2123-13123 

Ecjuity 

equity 

03/01-03/09: 3210+3000 

03/09-12/99: G105 

Cash and balances due from depository institu¬ 
tions 

cahab 

03/01-12/99: 0081+0071 

U.S. treasury securities 

ust_sec 

03/01-12/99: 0211+1287+RCON3531 

U.S agency securities 

agency _sec 

03/11-12/99: 1289+1294+1293+1298+RCON3532 

Securities issued by state and local governments 

state_sec 

03/01-12/99: 8496+8499+RCON3533 

Mortgage backed securities 

mbs 

03/01-03/09: 1698+1702+1703+1707+1709+1713+1714+1717+1718+1732+ 
1733+1736+RCON3534+RCON3535+RCON3536. 

06/09-12/10: G300+G303+G304+G307+G308+G311+G312+G315+G316+ 
G319+G320+G323+G324+G327+G328+G331+RCONG379+RCONG380+ 

RCONG381+RCONG382 

03/11-12/99: G300+G303+G304+G307+G308+G311+G312+G315+G316+ 
G319+G320+G323+K142+K146+K145+ 

K149+K150+K154+K153+K157+ 

RCONG379+RCONG380+RCONG381+RCONK197+RCONK198 

Asset backed securities 

abs 

03/01-12/05: B838+B841+B842+B845+B846+B849+B850+B853+B854+ 
B857+B858+B861 

03/06-03/09 C026+C027 

06/09-12/99: 

C026+C027+G336+G340+G344+G339+G343+G347 

Other domestic debt securities 

dom_debt_oth_sec 

03/01-12/99: 1737+1741 

Foreign debt securities 

for_debt_sec 

03/01-12/99: 1742+1746 

Residual securities 

res_sec 

03/01-12/99: A511 

Futures, forwards sold and securities purchased 
under the agreement to resell (asset) 

ffrepo_ass 

03/01-12/01: 1350 

03/02-12/99: RCONB987+B989 


Construction loans 

03/01-12/07: RCON1415 

03/08-12/99: RCONF158+RCONF159 

Loans secured by real estates in domestic offices 

Secured by farmland 

03/01-12/99: RCON1420 


1-4 Family real estate 

03/01-12/99: RCON5367+RCON5368+RCON1797 


Multifamily property loans 

03/01-12/99: RCON1460 


Non farm, non residential 

03/01-12/07: RCON1480 

03/08-12/99: RCONF160+RCONF161 

Loans secured by real estate in foreign offices 

ln_re_for 

03/01-12/99: (if preHent) RCFD1410 - ln_const - 
ln_farm - ln_rre - ln_nmlti - ln_iifnr. 

3/01-12/99: (otherwise) zero 

Commercial and industrial loans in domestic of¬ 
fices 

ln_ci_dom 

03/01-12/99: RCON1766 

Commercial and industrial loans in foreign offices 

ln_ci_for 

03/01-12/99: (if present) RFCD1763+RFCD1764 - RCON1766, 

03/01-12/99: (otherwise) zero 

Loans to consumers in domestic offices 

ln_cons_dom 

03/01-12/10: RCON2011+RCONB538+RCONB539 

03/11-12/99: +RCONB538+RCONB539+RCONK137+RCONK207 

Loans to consumers in foreign offices 

ln_cons_for 

03/01-12/10: (it present) RCFD2011+ RCFDB538+ 

RCFDB539 - Irneonsniom, (otherwise) zero 

03/11-12/99 (it present) RCFDB538+RCFDB539+ 
RCFDK137+RCFDK207-hi_cons_doni, (otherwise) zero 

Loans to depository institutions and acceptances 
of other banks 

ln_dep_inst_banks 

03/01-12/99: (it present) RCFDB532+RCFDB533+RCFDB534+ 
RCFDB536+RCFDB537. (otherwise) RCON1288 

other loans 

othJoans 

03/01-12/99: 2122+2123-ln_const-hi_farm-hi_rre-ln_multi-ln_nfnr- 
ln_re_for-ln_ci_dom-ln_ci_for-ln_cons_dom-ln_cons_for-ln_dep_inst_banks 

Equity securities that do not have readily deter¬ 
minable fair value 

equ_sec_nondet 

03/01-12/99: 1752 

other assets 

oth_ass 

03/01-12/99: tot_ass - all preceding assets 





B The Maximum Entropy Approach 

In this appendix we provide the details of the derivation of the probability mass 
fnnctions for the MECAPM, BIPWCM and BIPECM ensembles. In what follows 
we indicate with S (P^ [X]) the entropy fnnction 

^ (P^ [X]) = - P^ [X] log (P^ [X]). 
xev 


B.l Maximum Entropy Capital Asset Pricing Model 

The maximum entropy distribution P^ is obtained by Ending the distribution that 
solves the constrained maximization 


max S (P .,9 [X]) 

P,,(X) 


s.t. = 


xev 


A* a 


Ev [Xn,k] = n = l,...,N, k = l,...,K. 


C = ^(P,9[X]) + a 1- ^ p,9 (X) I 


A* r'* 


The Lagrangian associated to the problem is written as 

N K 

\ 

.,k 

xev / n=i k=i \ ~ xev 

where a and Xn,k are Lagrange multipliers. Taking the first derivative we get 

N K 




dL 


9P,9 (X) 

whose solution is 


= -log(P ,9 (X))-l-a Xn,k X^i^k 0, 


n=l fc=l 


P,9 (X) = 




where (X) is the function ; A —)■ M defined as 

N K 

H^{X) = EE Xn,kXn^lc, 

n= 

and is a normalizing factor given by 


n=l fc=l 
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Note that the partition function in ( [II| is such that 


d\ 


n,k 


( 11 ) 


( 12 ) 


Hence the Lagrange multipliers are determined by 


aiogZ^ 


dX 


n^k 


L* ’ 


which gives 


N K / ^CAPM 

p«(x)=nn(rTfsAPM 


X„ 


n=lk=l 


1 + XCAPM / 1 


where 


4* C* 

^CAPM _ '^k 


^n.k 


(13) 


B.2 Bipartite Weighted Configuration Model. 

The maximum entropy distribution is obtained by hnding the distribution that 
solves the constrained maximization 


max S (P^ [X]) 

P^(X) 

s.t. P^ [X] = 1, 

xeA 

Ex [An] = A*, n = 1, 

Ex[Ck] = Cl k = l,...,K. 


Using one Lagrange multiplier for each quantity indexed by n, one Lagrange 
multiplier pk for each quantity indexed by k and one, a, for the constraint on 
the total probability XIxea (^) “ Lagrangian function that has to be 


maximized is written as (Park and Newman, 2004) 


C = 5(P^[X]) + a 1- 5^P^(X) 


xeA 


N / 

( 

n=l \ 


H* - P^ (X) 

xeA 


Ain (X) + 
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which taking its hrst derivatives becomes 


5P,9 (X) 


- log(P^ (X)) -1 - a - ^XnAn-^VkCk = 0, 


whose solution is 


P ,9 fX) = 




where H,<f fX) is the a function if ,9 ; df —)■ M defined as 


ff^(X) (X) + E%C'fc(X), 


and is a normalizing factor given by 


Zo=Z. 




We can go on with computation by explicitly writing the expression of (X) and 
Ck (X) in terms of the elements of the matrix X, obtaining 

N K 

(X) = E] E/ ’ 

n=l k=l 


- Enn 
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N K 00 
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) ^n,k (-^n ~\~Vk) 


M K MK 

nn X _ g-(A„ +%) =nnrv^. 

n=l k=l n=l k=l 


where cpn = ^ and ^k = ^ whence 


P,9fX) = 


n^-i IIa^i 6“^’"-'= An +m) 

rr^ _1_ 

lln=l llA:=l 
N K 
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The value of the Lagrange multipliers are determined by imposing that the expected 
value of An (X) and Ck (X) on the ensemble X are equal to, respectively, A* and 
C^. As for the MECAPM case, note the partition function in (16) is such that 

dlog{Z^) 


d\ 




and similarly 


[Ck] = 


— [An\ , 

d\og{Z^) 

dr]k 


Hence we can compute [A„] and [Cfc] explicitly as a function of the Lagrange 
multipliers, that is 


d 


N K 


log (1 


3 —(An +'rjk 


^ o-An+Vk) 


dXn 

E 

k=l 

d 


>) 


n=l k=l 

e~ 


\ — g — (An +Vk) ’ 


N K 


= ^EE'“s(i 

n=l k=l 
g-(An+»?fe) 

I _ g-(An +r}k) ■ 


o-Ar 


+Vk)^ 


n=l 


Therefore the Lagrange multipliers are determined by numerically solving the 
linear system of equations 


jy 


= 

n = 

1,. 


k=l 


.N 

^k 

= Cl, 

k = 

1,- 

..,K. 

n=l 

^k 


non- 


(18) 


B.3 Bipartite Enhanced Configuration Model. 

The only difference with the Weighted model described in Appendix |B.2| is the ad¬ 
dition of the constraints on the number of degrees for each node. Before proceeding, 
we have thus to add some additional definitions. 

The binary projection of Xn,k is dehned as the matrix W,A; = lx„fe>o- Accordingly, 
the number T*™'" of assets in which the n-th bank invests and the number of 
banks that own the fc-th asset class are computed as 

K N 

D'r (X) = E (X) = E (1**) 

k=l n=l 

where the capital letter D stands for degree, as it is common practice in network 
theorjj^ 

have chosen not to use a different notation for bank and asset degrees since, although they 
have an immediate economic interpretation, they are not frequently used in economic analysis. 
In fact, while and Ck are typically publicly available, the degree sequences D™ and are 
instead difficult to trace. 












The maximization problem for the BIPECM case is hence stated as 


max S (P^ [X]) 
s.t. 1 

E;t [An] = A*, n = 

E;^[Ck] = Cl k = l,...,K, 

E;, [Df] =Df\ k = l,...,K. 

The Lagrangian function is written accordingly as 


N 




C = 5(P^[X]) + a 1- ^ P^(X) ^ P^(X)T„(X) ) + 

V X&X / n=l V 

+E P" (o”’* - E *’•» (X)) + 

n=l V X&X / 

+Y.^k(ci- + 

k=l \ X£X J 

+ 4 f D-f (^) ’ 


k=l 


xeA' 


with an obvious extension of the number of Lagrange multipliers with respect to the 
Lagrangian in (14). The first order condition reads now 

dC 


= - log (P^ (X)) -1 - a - A, (X) - pn (X) - % (X) - 4 Df (X) = 0 


(X) 


Since (X) = lx„,fc>o (similarly, (X) = lx„,j,>o), we obtain as a so¬ 

lution of the first order condition, an expression for the probability mass function identical 
to that obtained in (15) for the Weighted Model, that is 




P^ (X) = 




( 20 ) 


with the precaution that, now, the function (X) and the normalizing factor are, 
respectively, given by 


N K 

(X) = EE [Xn,k (An + rjk) + bx„,fc>0 {Pn + 4)] , (21) 

n=l k=l 


and 


Z^ 


x&x 


N K , . 

Enn g— [^n,k i^n+'rik) + ^X^ f.>0 (Pn+^fc)j 


X&Xn=lk=l 
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Defining ipn = e , Cfc = e , V'n = e and 7 fc = e '^'= we get 

iv A" 


^•»=nn 


n=lfc=l 


1 - y’ngfc (1 - V’nTfc) 
1 


( 22 ) 


Finally, putting expression (22) and (21) into (20), we get 


N K 


(x)=nn 


{l-(Pn^k) {(fn^k) ii’nlk) 
1 fn ^k (1 '4^n 7fc) 


(23) 


n=l k=l 

The determination of the Lagrange multipliers follows a procedure identical to that de¬ 


scribed for BIPWCM in Appendix B.2, that is the expected values of (X), Ck (X), 
(X) and (X) are computed as the partial derivative of — log (Z^) with respect 
to the corresponding Lagrange multiplier and then equated to the observed value. This 
procedure produce the non-linear system of equations 
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= D^*, n = l,...,n, 
= C*, k = l,...,K, 




(24) 


C Explicit Formulas for Expected Values uuder 
MECAPM 


This appendix is dedicated to derive explicit formulas for the expected values of bank 
systemicness and indirect vulnerability under the MECAPM ensemble. Moreover we show 
that these expected values are well approximated by the corresponding values returned 
by the CAPM-implied matrix. First note that, according to equation (13), the element 
Xn k is geometrically distributed with parameter p = . , ^oapm and thus 


E [Xl2 - E = ^ = + 1) . 
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Hence the expected value of the systemicness of bank n is given b} 


E[5„(X)] = 
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Note that since 
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In order to evaluate the relative error above consider the simplified case in which all the 

'k ^ — K 

E [Sr, (X)] - Sr, (XCAPM) ^ ^ 


capitalization are almost equal Ct ^ C* = hence 
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where the last inequality follows from the fact that L* = 
Concerning the indirect vulnerability we have 


E[IV„(X)] = £il±^E 
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^^Remember that in our setting all the illiquidity parameters are set to a common value £ except 
for the cash, which we assume to be the first class fc = 1, for which it is set to zero, whence the 
sum form k = 2 to k = K. 
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